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ABSTRACT 

The four parts of this report are: (1) the purposes 
of evaluation and reporting; (2) relationship to educational 
philosophy and methods of teaching, including historical developaent, 
present reporting practice and criticises of these practices, and 
alternatives to conventional grading; (3) the best way to report 
student achieveaent; and (4) evaluation to inprove instruction, with 
consents on standardized tests e criterion-referenced tests, 
self-evaluation, and promising trends in evaluation. (RC) 
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INTRODUCTION 



.Arc parents, teachers, atul students in your sclmo} satisfied 
with current mctliods uf evaluating aiu! reporting student 
achievement:' Probably not. Surveys indicate that most sclmol 
systems ar«- currently engaged in, haw recently completed, or 
art- about to liegiu a review or alteration of tlu tr evaluation and 
reporting prim-dmes. Hut change an- often slmrt lived as one 
school or school system adopts a method another sehool has just 
abandoned. Many schools go around in eireks. eventually return- 
ing to a ^porting system that had int ii discarded sometime 
earlier. 

Wh\ despite many years of experimentation and research, 
do many problems and issm's related to evaluation and reporting 
remain unsettled': 1 How ean the classroom teaclkT determine 
what evaluation and reporting s\stem is l>est for her/his class? 
How can evaluation and nporting .systems serve to improve 
teaching and fac ilitate learning':' 

To help find at least partial answers to these questions am] 
to stimulate further study, this report is divided into four parts: 
{ 1) The purposes of evaluation and reporting. (2) Their devel- 
opnient. past atul present, in relation to different educational 
philosophies and methods of teaching. (3) The l>est way to re- 
port student achievement, and < 4 ) The types of evaluation that 
can improve instruction. 



PURPOSES OF EVALUATION AND REPORTING 



One major reason for the great difficulty in finding an ideal 
evaluation am! reporting system is the multiplicity of purposes 
and interests of different people in t!u*se functions. A comprehen- 
sive and widely i{ta>ted definition of puqxw* 1 * is William L, 
Wrinkle s. published in 1H47 in his classic work. Improving, Mark- 
ing and Reftorting Practices. His definition follows: 

Purposes of Marking 

1. Adminisirativv functions. Mi.-ks indicate* whether a 
student has passed or failed. winder he should he pro- 
moted or required to repeat tlte gr.de or course, and 
whether lie should l>e graduated. They are used in trans- 
ferrins; a student from one school to a other and in judg- 
ing candidates for admission to college. They mav !x* usee! 
by employers in evaluating prospective einployc*es. 

2. (Guidance functions, Marks are used in guidance and 
counseling in identifying areas of special ability and ina- 
bility, in deciding on the advisability of enrolling the stu- 
dent in certain courses and keeping him out of otlters, ami 
in determining the number of courses in which he may be 
enrolled. 

3. Information functions. Marks are the chief means 
employed by tin* school in giving information to students 
and their parents regarding tlie students achievement, 
progress, and success or failure in his school-work. 

I. Motivation and discipline functiom. Marks are used 
to stimulate students to make greater effort in tlicir fram- 
ing activities. Tin y are used for the same purpose* in de- 
termining eligibility to honors of many different kinds such 
as participation in school activities, eligibility to play on 
the team. mcml>crship in selected groups, the winning of 
scholarships, etc. <3W. pp. 31-32) 
These four classifications are not mutually exclusive; the) 
overlap. For instance, the use* of marks for awarding a scholar- 
ship provides motivation, but it also serves an administrative 
function. Thus, almost \\\\ one function may l>e unde r more than 
one* classification. 
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Grading h Nat Bvatuating 

Much confused thinking has been caused by lack of distinc- 
tion between evaluation and marks or grades. Very often in edu- 
cational literature the terms have been used interchangeably, 

Evaluation of student achievement assesses how well stu- 
dents have achieved curriculum objectives. Grading, or marking, 
is not evaluating. Grading is merely one way to report an evalu- 
ation. A smile, a frown, a spoken or vritten comment, for exam- 
ple, are other ways to report an evaluation, as is selecting a 
certain student to lead a classroom discussion* And a computer 
printout profile of a student's accomplishment of dozens of spe- 
cific behavioral objectives is still another. 

What's Goad far the Goose . . . 

An evaluative or reporting technique that suits one purpose 
or person may not suit others, or it may even be in direct con- 
flict with other purposes or needs of other persons. For example, 
conventional hotter grades, A*F, have served as an administrative 
convenience to colleges in selecting students. Studies have shown 
letter grades in high school to be a fairly reliable indicator of the 
kinds of grades students will get in college. (18) But the grades 
students get in school are not good predictors of job performance 
in the world of work outside the school. { 15, 21 ) 

Or, to take another example, motivation may be enhanced 
for students who receive high marks or grades but may be de- 
stroyed for students whose self-confidence is undermined by fre- 
quent low grades. (3) 

Purposes of Evaluation 

The distinction between evaluation and grading is empha- 
sized in the 1967 yearbook of the Association for Supervision and 
Curriculum Development, Evaluation as Feedback and Guide. 
(4) This book is based upon the assumption that the major pur- 
pose of evaluation is to provide feedback upon which to base 
individual and institutional decisions to improve education. The 
authors state, The test of an evaluation system is simply this: 
Does it deliver the feedback that is needed, when it is needed, 
to the persons or groups who need it?" 
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If any system of evaluation is to meet this test, it must— 

1. Paeiiitate self-evaluation. 

2. Encompass every oi>jeettve valued by the school. 

3. Facilitate learning ami teaching. 

4. Produce records appropriate to the purposes fen* which 
records are essential, 

5. Provide continuing feedback into the larger questions 
of curriculum development and educational policy. (4, pp. 
4-6) 

The ASCD Yearlxiok Committee concluded that our present 
system of evaluation and reporting does not satisfy any of these 
criteria and should be replaced with an entirely new system of 
diagnostic teaching that provides for considerable student self- 
evaluation and cooperative evaluation by the school faculty. 

RELATIONSHIP TO EDUCATIONAL PHILOSOPHY 
AND METHODS OF TEACHING 

Different practices in evaluation and reporting result chiefly 
from different educational philosophies and goals and from dif- 
ferent methods of teaching and learning. 

If our educational goal is to convey a fixed body of knowledge 
to all students through teacher-directed methods, then a per- 
centage mark on a written test indicating how mud) of the body 
of knowledge was acquired is appropriate. But if the goal of tin? 
school is to develop the full potential of each individual student 
largely through self-direction, other means of evaluation and re- 
porting must be devised. 

When percentage marks, national standardized tests, and let* 
ter grades became firmly entrenched in the first two decades of 
this century, tin* goals of our schools were largely subject ori- 
ented. High school and college programs were based on elitist, 
highly selective standards. It was assumed that the predominant, 
middle class culture should Ik- the standard for all. 

Since then American education has been washed bv several 

0 

different waves of philosophy and method, each creating pres- 
sures for different kinds of evaluation ami reporting. Although 
these wave* of change dominated certain periods of time, no one 
of them has cut covered all schools at tin 1 same time or to the 
same extent. Today, for example, some schools that are corn- 
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pletely immersed in a wave erf humanism may be, seeking to 
eliminate grades or external evaluations altogether; some schools 
still concerned with implementing the new emphases on the sub* 
ject matter disciplines are trying to find more precise ways of 
defining and evaluating prescribed learnings, while most schools 
are working under a combination of these and other approaches 
to education and are therefore seeking an appropriate combina- 
tion of evaluation and reporting devices. 



Historical Development 

To place letter grades and other aspects erf evaluation and re- 
porting in perspective, it would be helpful to review how we got 
where we are now, 

Marks and grades are relatively new in the history of educa- 
tion. In tlu» early ISOOs in the one-room school where children erf 
all ages and abilities studied together, students often moved from 
mie level of work to anther in each subject as soon as the teacher 
determined that thev had mastered the prerequisite knowledge 
or skill. 

Some were slower than others, hut this did not cause diem to 
be labeled as failures. They just took longer or received more 
help, as required. The teachers' reports to students and parents 
consisted of spoken or written descriptions of what the students 
had learned. Students* knowledge was tested not for the purpose 
of assigning grades but to demonstrate progress and for those 
few who were planning to go to college to determine what addi- 
tional instruction was necessary to prepare them for college work. 

Competition for admission to college did not become a 
major problem until after the rapid expansion of public high 
school enrollments in the last quarter of the nineteenth century. 

As elementary and high schools grew in size, the students 
were divided into different levels or grades in the effort to pro- 
vide more efficient mass instruction. Percentage marks began to 
Ik* widely used for two major purposes: to help teacher* classify 
students and to help colleges screen their applicants. 

The mark became the major basis for determining whether a 
student was to he promoted to the next level and whether she/he 
was to be admitted to college. 
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The first major challenge to the marking system came from a 
study- conducted by Starch and Elliott in 1912 which demon- 
strated very dramatically the subjective nature of supposedly 
objective grades. It was found that when the same English test 
paper was submitted to different teachers for grading the numer- 
ical scores assigned varied as much as 47 points cm a lOfcpoint 
scale. (35) In a later study there was a range of 67 points in 
marks assigned by different teachers to the same geometry test! 
(34) As similar findings from both other studies and informal 
observation confirmed the lack of precision of percentage grades, 
educators began using scales that had fewer but larger categories* 
Some used a three-point scale: Excellent Average, or Poor, Some 
used a five-point scale; Excellent. Good, Average, Poor, or Fail- 
ing, symbolized by A. B, C, D f F. The five-letter grading system 
came to he tin* one most prevalent in high schools and infiltrated 
elementary schools as well 

Between 1900 and 1920 the emphasis was upon making the 
meaning of grades more objective and dear* During this period 
scientific development of objective tests began- The use of stand- 
ardized tests based on national norms to screen men fen- military 
service during the First World War stimulated the use of such 
tests in schools. 

The two main types of tests used were achievement tests in 
the major subjects ami IQ tests. Many educators interpreted IQ 
score- to be permanent indicators of general intelligence, and IQ 
became synonymous with academic ability. In recent years, bow- 
ever, many studies have shown that IQ tests measure only a few 
aspects of intelligence, are subject to substantial change during a 
persons life as a result of her/his particular experiences, and are 
particularly invalid for minority groups whose cultural or lan- 
guage background is different from the "average American-" 
Furthermore. IQ scores can sometimes be affected by merely 
studying for the test. 

Paralleling the growth of standardized testing was a move- 
ment to make grading fairer by allocating grades for student 
achievement on the normal curve. According to one interpreta- 
tion, 3 percent of students should receive A 9 s 9 24 percent B's, 46 
percent Cs. 24 percent Ds, and 3 percent ETs. Thus, try as they 
might a certain percentage of students were doomed to fail 
Furthermore, since the national distribution of abilities is not 



evenly distributed in every classroom or school, students of like 
ability and achievement receive different grades according to 
where they may happen to live. 

Some educators in the 1620's tried to eliminate rather than 
tinker with grades. Some schools substituted verbal descriptions 
of the students' abilities. Some tried pass/fail systems. Others ad- 
vocated a mastery approach in which evaluation would seek to 
determine only whether or not a student has mastered an educa- 
tional objective. 

The Progressive Education movement beginning in the late 
1920s and reaching its peak in the 1930s stimulated much ques- 
tioning of and experimentation with various means of evaluation 
and reporting-secking to foster growth of the total person, indi- 
vidual differences, and individualized instruction. The Progres- 
sives believed that education should bring about changes in bc- 
havior. not just facilitate the memorization of facts. 

A comprehensive report on evaluation based on the Eight 
Year Study of the Progressive Education Association stated that 
(a) evaluation and recording must be directly related to each 
schools purposes and philosophy, (b) an evaluation program 
must be comprehensive, including appraisal of progress towaid 
all the major objectives of the school, and (c) teachers should 
participate in the constntction of all instruments for evaluation 
and forms for records and reports. (.12) 

The Progressive Education movement stimulated experimen- 
tation with anecdotal records, student-teacher and parent-teacher 
conferences, letters to parents, check forms, and evaluation in 
terms of achievement of educational objectives, including behav- 
ioral objectives. 

During the 1930s. William L. Wrinkk- and his colleagues at 
the Colorado State College at Greeley tried and discarded almost 
all of the alternate types of reporting discussed in this report. 
They concluded that evaluation of student achievement should 
be in terms of behavioral objectives. But they listed only 15 broad 
objectives on reports to parents, keeping the specific objectives in 
each subject for internal use. Interestingly, the reports included 
an evaluation of how well students had achieved the objectives 
for each subject on a five-point scale. Words were used, but their 
meaning could easily be translated into conventional A-F fetter 
grades. The faculty had intended to keep these evaluations for 
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confidential student records only, but the students requested that 
they be sent to parents. 

"You are fortunate that you can begin your work without 
having to go through ten year* of learning the hard way," 
Wrinkle wrote to teachers interested in in proving evaluation and 
reporting. "But don't get the* idea that your job is going to be 
easy. It won't be. Often the* understanding and application of a 
verv simple idea is verv difficult— because it is so simpler (39, 
p. 92) 

The Second W orld War interrupted the fi nnent of change 
and experimentation brewed by the Progressives. Science ami 
math became weapons in the war and in the* postwar struggles 
with the U.S.S.R. for power ami prestige. There was a movement 
toward a return to fundamentals in education with emphasis on 
the subject rather than the learner. In many schools this was ac- 
companied also by a return to the old. academically respectable 
fetter grades as a presumed means of motivating students to 
work harder. 

The major thrust of new development was to redesign the 
curriculum* erf the* subject matter disciplines. Although profes- 
sional educators wen* given some role in the revision process, 
control was in the hands erf erudite scholars of the various disci- 
plines. The) sought to put more depth, more significant concept 
building, more systematic study of the structure of each disci- 
pline into the new curriculums. 

Another significant postwar development was programmed 
instruction. Programmed instruction brought more thnn teaching 
machines to the schools. It brought the philosophical influence of 
behavioral psychologists, such as B. F. Skinner, who believe that 
a high degree of control can and should be imposed on individ- 
uals to shape their behavior to predetermined goals and that 
learning can be very systematic, sequential, and efficient The be- 
havioral psychologists gave new impetus to the application erf 
behavioral objectives in curriculum planning and evaluation on a 
much more rigid basis than the objectives advocated by Tyler 
and other early Progressives. 

Although the new curriculums ard programmed systems of 
instruction have met with varying success, most of them failed 
to live up to the expectation that they would bring dramatic im- 
provements to American education. Perhaps because of disap- 
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pointment, perhaps because they lost interest or hope, academic 
scholars allowed control of the schools to drift back into the 
hands of professional educators. 

At any rate, during the 1980s, the influence of the academic 
elite was overwhelmed by the influences of the civil rights move- 
ment which sought equal opportunity for minorities in educa- 
tion, by student activists seeking more relevant education, and by 
a renaissance of the basic tenets of Progressive Education under 
a new title; Humane Education. 



Present Reporting Practice 

In the 1970s, as more schools seek to reemphasizc growth of 
the total person, respect for individual differences, and individ- 
ualized instruction, they are attempting to devise and apply 
appropriate means of evaluating and reporting pupil progress. 
But as a glance at Table I will indicate, the same old means of 
reporting were generally in effect in 1971, when NEA Research 
last surveyed pupil progress reports to parents. 

A classified scale of letters is the most popular procedure 
used in junior high (82.4 percent) and senior high schools (83.8 
percent). Teacher-parent conferences appear to be most popular 
in kindergarten, used by 85.4 percent of the systems, and in grade 
1* used by 77.2 percent of the systems. In grade 4, each proce- 
dure is used by about 70 percent of the systems. Note that some 
systems must use both procedures because of the relatively high 
percentages of each. Some of the other procedures must also be 
used in the same systems. 

Descriptive word grades, although used by about one third 
of the systems in kindergarten and grade 1, are used infrequently 
in the junior and senior high schools. A formal letter or written 
paragraph to parents is more popular than descriptive word 
grades in the high schools. The other procedures are used by 
relatively small percentages of the systems at each level. The 
data showed no differences in the procedures used by different 
size systens. 
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Table I.~Pupil Progress Reports to Parents Used by School 
Systems for Selected Grade I-evels, 1971. 
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Source: 

National Education Association. Reseaa*h. "Reporting Pupil Progress to 
Parents 1966 am! 1971. " SEA Rrmvnh Memo 1972- W. Washington, DC: 
the Association, Scptetnln-r 1972, p. 1. 



Checklists, perhaps because they are generally considered to 
supplement a basic report, were 1 not listed in this survey ques- 
tionnaire, but other studies have shown that they are used by 
many schools at all levels. 



Criticisms of Present Reporting Practice 

Obviously there is wide diversity in reporting practices in 
schools of the United States. Such diversity is bound to cause 
problems in a society so mobile that one out cf five families 
moves each year. But some educators take comfort from the 
situation by reasoning that tin* diversity indicates a willingness 
on the part of many schools to continue tc experiment in a search 
for a procedure that desert!** adequately all the dimensions of a 
pupil's physical, mental, and emotional development. Further- 
more, no cither aspect of American schooling is uniform* As long 
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as we have great differences in educational philosophy and prac- 
tice, w shall continue to have difference* in evaluation and re- 
porting of student progress. 

The Tail That Wags the Dog 

However, despite many variations, letter grades or marks 
definitely dominate our schools from elementary school through 
college. They dominate them not only as a means of evaluation 
and reporting, hut in some schools they dominate the entire 
learning process. Many educators now believe that instead of 
being an adjunct or aid to the learning process, grades have be- 
come a major obstacle to teaching and learning. ( 17, p. 78) 

What's Wrong with Letter Grades? 

Following is a summary of five of the major negative criti- 
cisms of letter grades as a means of evaluation and reporting: 

L Grades rather than learning have become the goal of 
students. This has greatly arrowed the real curriculum of the 
school, encouraged cheating, and discouraged desirable learning 
attitudes and habits. (17) 

2. Grades do not tell us mything Sfwcific. Since they are 
derived from a composite of main factors, we can't tell, for exam- 
ple, whether a student who receives a C in English can express 
thoughts very clearly but cannot spell correctly, whether the re- 
verse is true, whether the child is strong in literature or weak in 
grammar, or whether she he is average in all her/his English 
skills. (7) 

3. Grades have differetit meanings Itecause of subjective 
factors. For example, studies have shown that teachers generally 
tend to grade girls higher than !>oys and middle class children 
higher than lower class children for identical achievements, (10, 
29) 

4. Grades have differetit meanings because they are Itased 
tifwn different criteria. In different schools, and even within the 
same school or classroom, a grade may represent an attempt to 
measure achievement in relation to one's classmates, in relation 
to national norms, in relation to each child's ability, in relation to 
teacher expectations, in relation to pupil effort, or a combination 
of some or all of these factors. (39) 
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5. Low grades have serious adverse effects on some stu- 
dents self* once pt.s that not only interfere with learning but h^ve 
serious negative lifetime effects cm personality and mental health. 

<») 



Recent Attacks on Marks and Grades 

In recent years the grading system has been strongly at- 
tacked by organized groups as well as by individual researchers 
and writers, Here are just a few examples: 

In the 1960s many student demonstrations and revolts on 
college campuses sought, among other things, elimination or re- 
form of the grading system. 

In 1967 the Yearlxxik Committee of the Association for 
Supervision and Curriculum IX'velopment. in Evaluation as 
Feedltack and Guide, declared that the prevalent marking grad- 
ing system is largely responsible for making our system of evalu- 
ation grossly inadequate in providing valid feedback to the indi- 
vidual learners tliemselves and to the larger units of the school 
system and for gross exaggeration of the more mechanical, easier- 
to-measure features of education. M The end result is not simply 
!>ad evaluation; it is distorted teaching and learning/* (4, p. 15) 
A staff report of the Xattonai Education Association, Schools 
for the 7(fs and Betjomh A Call to Action, published in 1971, de- 
clared. "Grading— the process of appraising youngsters in relation 
to one another, ranking them on a ladder from best to worst- 
is not onlv unneccssarv to instruction but is oftt n nositivelv harm- 
ful/* (24. p. fri) 

And in the same year the Board of Directors of the National 
Council of Teachers of English declared: 

Reporting a child's progress in the early years should be 
done through methods other than the assignment of a let- 
ter or numerical grade. Rather, the reporting of a child's 
progress should l>e through regular conferences between 
teacher and student based upon anecdotal records, com- 
parative samples of the child's own work, tin* teachers 
estimates of the child's growth in skills, and his growth 
toward achieving other goals that the community and the 
school might have set . . . 
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After tlu» early ycani, at all educational levels only pass- 
ing grades . . . slwukl Ix» recorded on a student* perma- 
nent record. ... (26) 

Why Have Marks and Grades Persisted? 

Why, despite these many criticisms, do letter grades persist 
as the major means of reporting student achievement? 

1. Inertia to clwnge anything in life makes people cling to 
familiar things even when they are not satisfied with them. 

2. Familiarity with grades makes it difficult for parents, 
students, and teachers to conceive of evaluation in other terms, 
even when changes are attempted. Many educators who haw 
tried to .substitute another means of reporting to parents have 
had reactions like the following: After a 10-niinute parent-teacher 
conference or after an examination of a descriptive letter or 
checklist of learnings achieved, a parent says, "Tins is very inter* 
esting.' hut what does it realhj mt an? Is Maria a B student a C 
student, or what?" 

3. Grades represent simple etmeept* % and people usually 
prefer the simple to the complex. 

4. Grades arc east/ far teachers to prejnjre compared to 
most other means of evaluation that have lx*en tried. Several of 
the alternatives that have !>een proposed are so time-consuming 
that most teachers find it impossible to carry them out properly 
over an extends! period of time. 

5. Grades are a spur to classroom jwrformance for stu- 
dents. Studies have shown that in conventional school situations 
when grades are eliminated for some students and retained for 
others, work output of the ungraded student decreases in com- 
parison to the graded student's. H4) 

6. Grades are used to srrcen students for college admis- 
sion. Mow that many parents want their children to go to college, 
they start worrying al>out whether tlieir offspring will have the 
grades for college even liefore they enter school, 

7. No substitute for grades has won continuing support. 
Except for a few limited applications, such as parent-teacher con- 
ferences in the kindergarten, stilistitutcs for grades have proven 
to Ik* too time-consuming, too difficult to use or understand, or 
have not found continuing acceptance for other reasons. 
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Alternatives to Conventional Grading 

Following is a brief description ami analysis of each of the 
major alternatives or supplementary means of reporting student 
achievement that schools are trying currently. Since most of them 
have been tried since the 1920s, considerable research and ex- 
perience have accumulated to help us assess their advantages 
and disadvantages. 



Oval Grades and Ratings 

As indicated earlier, a single grade mark could mean any 
one of a munlier of things. It could indicate the achievement of 
a student in terms of an absolute scale; in terms of other students 
in her his class, school, school system, or nation; or in terms of 
her/his own ability, growth, or effort. Some schools attempt to 
define clearly upon which of these fat-tors a grade is based. Some 
schools deliberately combine several of the factors. And some 
teachers consciously or unconsciously combine factors even if it 
is not official policy. 

To reduce such confusion, some schools grade on more than 
one basis. A common type of dual grading used in both elemen- 
tary and high schools is the achievement 'effort combination. 
Achievement is usually rated A-F and effort O for outstanding, 
S for satisfactory, or U for unsatisfactory. 

Another type of dual grading, used mostly in elementary 
schools, employs one rating to indicate individual progress and 
another to indicate level of performance. For example, A 4 mav 
indicate that a sixth-grade student has achieved outstanding 
growth in reading, but is reading at a fourth-grade level. 

There are many otlk-r variations on this theme, with some 
schools giving three or more grade s in each subject- 
Theoretically, dual grading systems should be superior to 
single-grading systems l>ecause they describe more than one 
aspect of achievement. But studies have shown that there is a 
halo effect which causes teachers to tend to give students who 
achieve high or low grades in terms of giade norms similar 
grades in terms of ability. (13) 
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Rating Scales Other than Percentages or A-f 

In efforts to reduce emphasis on working for grades and the 
discouraging effects of poor grades, some sciioolg have used rat- 
ing scales with only two or three steps and have tried to express 
poo: ratings euphemistically. 

Examples of such two-point rating scales are— 

S— Satisfactory or G— Good Progress 

U~- Unsatisfactory N— Nfrvds To Improve 

But tlk'rc is a tendenex to expand the number of ratings to 
three, four, or more, such as- 
Outstanding 
Satisfactory 
Is Showing Growth 
Needs Improvement 

This seems to l*e another case of history repeating itself, for 
as Wrinkle wrote in 11*47: 

The experience of many schools that changed to the S 
and l f marking system was that with the removal of the 
possibility of an A, B, or C many students l>ecamc con- 
cerned only in staying just over tlie border in the S area. 
As a corrective move to recov er the stimulation which the 
A had provided, they added a third letter, usually an H 
which stood for "Honors/' Some affixed + and — signs to 
the II and S (II. S + . S, S- ), and they were back 
almost where they started; tliey then had a S-point mark- 
ing system. 

Progress in the improvement of marking and reporting 
practices cannot lx» achieved by the mere manipulation of 
symlwls. Alxiut the l>est that can be said for the substi- 
tuting of S and U or H. S. and V for A BCDF marks is 
that thereafter the* school is brought face to fact* with the 
fact that what it thought was a problem in marking is 
fundamentally a curriculum problem. If students quit 
working when the incentive of marks is removed and the 
staff is unwilling to admit that they can Ik* stimulated to 
learn only by the use of such extrinsic pressures, then the 
staff has discovered something fundamental. ... (39, pp. 
51-52) 
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Parent-Teacher Conferences 



Many educators believe that parent-teacher conferences are 
the ideal method of reporting to parents. If .properly handled, 
they can provide individualized, diagnostic, and tactful two-way 
communication. To reach tf us ideal the teacher needs to keep 
and refer to written evaluative records such as anecdotal records, 
tests, checklists of student achievements, and samples of pupils' 
work. 

A written record of the conference should be kept for future 
reference. Some school systems provide for a written report to be 
prepared by the teacher in advance as a guide to discussion and 
for presentation to the parent during the conference. 

Obviously, a major drawback of the parent-teacher confer- 
ence is the great amount of time it takes. It is almost impossible 
to use it on a regular basis in secondary schools where each 
teacher may teach MX) to 150 students. Its use as the major means 
of reporting is generally limited to kindergarten and the first 
grade. However, it is being used increasingly at all levels to sup- 
plement written reports. 

Narrative Reports 

Narrative reports range from teacher-written letters to com- 
ments under various headings on a report card. Often narrative 
reports are alternated with parent-teacher conferences m kinder- 
garten and primary grades. Such reports offer some of the same 
advantages and disadvantages as parent-teacher conferences. 
They can lie tailored to the particular child and parent and can 
give meaningful details of progress and problems. However, 
writing a good letter or comment of this type is difficult, and 
teachers often resort to vague, general language. 

Some schools have lx«en experimenting with computers to 
help write letters to parents. The teacher, who knows the com- 
plex particularities of each student's achievement and potential, 
selects from a large number of preeoded comments that have 
been programmed into the computer. The computer then prints 
out tin- letter or report form. When properly used, this method 
both saves a great deal of teacher time for instruction and per- 
mits detailed reporting of each student's progress. At present the 
major complaints against this method have been awkward combi- 
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nations ot words ami phrases, sonic gross errors, and under utili- 
zation of the sophisticated technology. If these flaws can be 
worked out, computers may become a generally useful and prac- 
tical aid to effective recording and reporting of student achieve- 
ment 

Checklists 

Checklists are shortcuts to tlx* writing of reports by teachers 
in which preprinted statements of student cliaracteristics, learn- 
ings, or behavior are cheeked or marked with some other evalua- 
tive symbol. Using checklists is the simplest way to report the 
most information with the least expenditure of teaelier time and 
effort. (39) 

There are mam- types of checklists. They range from— 

1. Vague descriptions of a few character traits and study 
habits supplementing conventional reports on academic subjects 
( _ gets along well with others), through 

2. Positive ev aluations used to report what the student has 
achieved ( reads with understanding), to 

3. Precise statements of behavioral objectives for all school 
subjects and goals: (Given a human skeleton, the student must 
be able to correctly identify by labeling at least 40 of the follow- 
ing bones: , . ). 

Behavioral Objectives 

The use of behavioral objectives as a guide to teaching and 
evaluation has been one of tin 1 most publicized new concepts in 
education during the 1960s and early 1970s. Because of the 
difficulties many teachers have in implementing this much talked- 
about concept and because of several different interpretations 
and controversial issues, more space will be devoted here to 
analyzing this means of evaluation and reporting than to any of 
the other alternatives. 

Purist proponents of behavioral objectives hold that all in- 
struction can and should !>c planned to bring alx>ut controlled 
changes in behavior, that the learner must l>e able to demonstrate 
the change through performance, and that it is possible to ob- 
serve and measure the success or efficiency of the student's per- 
formance. 
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Guides to writing behavioral objectives usually emphasize 
specificity, and most definitions of l>chavioral objectives follow 
Mager s three criteria. 

A behavioral ol>jective states- 

• An action performed by a student. 

• The condition under which the performance is to occur. 

• The criteria of acceptable performance. 

For example: "Given a human skeleton [condition], the stu- 
dent must be able to correctly identify by labeling faction] at 
least 40 of the following liones . . . [criteria]." (23) 

According to Gets, "Most authors pay special attention to tin* 
writ in the statement of an objective. Such 'non-observable' verbs 
as think, appreciate, enjoy, know, an* unacceptable while specific-, 
observable action verlw (e.g.. writes, assembles, states) are given 
a stamp of approval." ( 12 ) 

However, mam educational scholars such as Ralph Tyler, a 
pioneer in tin- study and use of behavioral objectives, include in 
the definition of behavior "all kinds of human n-actions like think- 
ing and feeling as well as overt reactions It includes attitudes 

toward subjects or things. It encompasses being able to solve 
problems ami to acquire intellectual skills like reading or phvsieal 
skills like running." (.31) 

It takes a large number of specific behavioral objectives to 
describe goals and accomplishments in any subject. For example, 
the goals for a semester of mathematics may take 100 or more 
objectives to describe. This specificity facilitates use of ix'haviora! 
objectives for internal evaluation to improve learning. An objec- 
tive stated specifically (e.g.. "Given a set of figures, the student 
will identify those which contain a right angle. ") is useful to the 
student in guiding l»er/his learning and developing self-evalua- 
tion, and is useful to the teacher in planning diagnostic teaching. 

However, as Wrinkle found, it is probably inadvisable to re- 
port to parents in terms of all the detailed spk-ific course objec- 
tives. Many of the objectives would not be umlerstocxl bv a num- 
ber of parents, while those who did understand them might at- 
tempt inadvisable corrective measures. ( 39) 

Some scholars Mieve that it is not desirable to use objectives 
that are as specific as those- currently in vogue. Tvler savs: 

I think many current uses of the ti-rni. behavioral objec- 
tive*, imply procedures that are too specific. I believe that 
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the individual human being is able to solve many of his 
own problems and so 1 think thot more of our educational 
objectives should Ik* general in nature-like learning how 
to go about attacking problem*, finding out where the dif- 
ficulties are, getting information, analyzing tin* data, and 
drawing inferences from data. 

Hence, in my view, many behavioral objectives should 
be* set at a considerably higher or more general level than 
tin* c rtninely specific things I find in many current efforts 
to write them. (331. p. 42} 
Otlier observers In-lieve that it is not possible for teachers to 
use as mam specific* objectives as an* getHTally advocated today. 
For example. Frances R. Link and Paul B. Diederich. in a pro- 
posal for a cooperative evaluation program, recommend reporting 
on schoohvidc objectives plus the* "four, five, or six most widely 
recognized objectives of e*aeh field of study— those that teachers 
art 1 able and willing to measure in one way or another at this 
stage*— not the long lists of 100 or mem* nlsjective* that the?y often 
submit as claims that are never substantiated." (4. p. 129) 

Several writer* advex ate different levels of specificity of ob- 
jective* for different purposes. Three levels are most frequently 
suggested. Krathwohl. for example*, classifies e>bje*etive*s as "glol>al, M 
"intermediate" (e.g., for a course ) , and "specific," (each rcpre- 
senting a skill or concept) (20). 

Even . some* of the* most enthusiastic advocates of behavioral 
objectives have found that "there are some important goals which 
we have for e>ur children which arc currently unassessable. To the 
extent that such goals art* extremely meritorious." states \V, James 
Popham, "they are* worth the* risk erf our pursuing them even if 
we cannot reliably discern whether they have be*en accomplished.** 
Popham suggests that in some content-laden classes the propor- 
tion erf nemmcasurahle* goals might be' smaller than in other 
course's such as tl*e humanities anel aesthetics. (27, p. 608) Fop* 
ham ami others now tend to speak of instructional rather than 
Ix'haviorul objectives. 

The*re* are* a number erf educators who take a baste stand 
against the use* erf behavioral objective* for am education other 
than training to do a specific task or to acquire basic skills needed 
bv all students. Some* critics of behavioral objectives tend to be- 
lieve that learning is e>r should In* largely self-directed, unstrue- 



O 99 

ERLC 22 



tared, ami in largo part unpredictable. Some challenge the con- 
cept that all learning should or can lead directly or immediately 
to changes in ltchavior. 

George F. Knellcr says flatly, "learning leads to no particu- 
lar behavior. . . . To use U'havioral objectives in individualized 
instruction is to overlook I he essential differem-es between indi- 
vidual learning, knowing, and behaving.'* ( 19) 

Robert L Ebel writes that thr real objectives of education 
are the knowledge and understanding, the attitudes and values 
that induce changed behavior now or in the future. Ebel con- 
cludes: 

Little that is wrong with any teachers educational efforts 
today can be cured by getting him to define his objectives 
more fully and precisely. We ought not to ask teachers to 
spend much of their limited time in writing elaborate 
statements of their objectives. (9, p. 173) 

Pass/Fail Or Credit /No Credit 

The second most talked about current innovation in evalua- 
tion and reporting is pass/ fail or credit/ no credit 

Pass fail system* haw been tried in colleges and high 
schools— 

1. To encourage students to take difficult courses which 
they otherwise might avoid for fear of lowering their grade point 
average. 

2. To allow students to apportion their study time accord- 
ing to their personal interests and needs, rather than spend most 
of their efforts in working for grades. 

3. To foster free ami creative approaches to learning. 

4. To reduce the anxiety of poor achievers bv encouraging 
them to concentrate on learning rather than on striving for a 
grade. 

Of course the latter objective is not achieved when a student 
is in danger of failing. The credit no credit system reduces such 
anxiety Invause no failing entry is made on the pupils record. 

Some credit no credit plans also offer another advantage. 
They allow a flexible time period in which a student mav com- 
plete the requirements of the course, l/nder this plan most stu- 
dents are assured not just <rf freedom from failure, but of even- 
tual success. 
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Some people think of l>ehavioral objectives and credit no 
credit system* as opposite*, representing control and frredom, re- 
spectively. Hut then* is uti increaMfig tendency toward marriage 
of these two seeming opposite*. XKAs Schools for the 7(fs states 
that though "operationally stated objectives* and "evaluation on 
the basis of performance, not time," were developed independ- 
ently, *\ . . these ideas an* not alternative ways of evaluating; 
ratlier in a complete scheme of e\ ablation, they can l>c used to- 
gether within the instructional framework to diagnose— teased on 
the concept that schools exist not to judge children, but to enable 
them to succeed/* { 23, p. 64 ) 

One reason for resistance to pass fail or credit no credit sys- 
tems has l>een the fear that colleges will not give favorable con- 
sideration to students who do not have conventional grade rec- 
ords. However, the authors of WAD-JA-GET? The Grading Game 
in American Education say that several survey* have indicated 
that most colleges art* willing to accept teacher evaluation letters 
and student self -evaluations, in lieu of the usual grades and class 
rank. (17, p. 314) 

The authors of \VAD-}A-GET? present an interesting pro- 
posal. They suggest that a high school allow individual teachers 
and students to choose* whether they wish to work in a conven- 
tional grading system or in credit no credit courses. Teachers 
may l>e allowed to try some courses of each type if they wish, 
but students must choose all courses in either traditional grading 
or credit no credit to avoid the* possibility of taking unfair advan- 
tage of tin* situation bv only taking difficult course* for credit no 
credit (17) 

WHAT'S THE BEST WAY TO REPORT 
STUDENT ACHIEVEMENT? 

The preceding discussion of the pros ami cons of various 
means of reporting student achievement should make it dear 
that neitlicr research, experience, nor theory have provided any 
single widely acccptahk* means of reporting. But though we find 
no simple solution, we do have some bases for making intelligent 
choices. 

In most school situations a combination of several different 
types of reporting is needed to reflect current philosophy, prac- 
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tices, and expectations. Because they an- so strongly entrenched 
in our school* and our culture, letter grades will probably con- 
tinue to lie used as one means of reporting pupil progress even in 
situations when* other means of reporting that an* inore consist- 
ent with current philosophy, goals, and teaching methods are 
used also. 

Classroom teachers have often felt frustrated because they 
had no choice or voice in selecting nutans of reporting, but in- 
creasingly, teachers are represented in committees considering 
schools- or system-wide changes in reporting am! sometimes they 
are even given an individual choice. Kven if you have no choice, 
you can make the !>est of the reporting system you have by under- 
standing its meaning, strengths, limitations and relationships to 
your teaching program and by helping students and parents to 
share* such understanding. You can try also to use means of t- abl- 
ation in your classroom not only to provide data for reposing 
but also as a means to improve instruction. 

EVALUATION TO IMPROVE INSTRUCTION 

Evaluation ami reporting are so closely interrelated that it is 
difficult to consider them entirely separately. Thus, several major 
if ues regarding evaluation have been discussed in the preceding 
sections on alternative reporting practices, particularly in the dis- 
cussion of behavioral objectives. But, as emphasized early in this 
report, evaluation should !*• considered primarily as a means of 
constant feedback for improvement of instruction rather than 
merely as a basis for occasional reports <if student progress. 

Although teachers use- observation of class participation, 
anecdotal records, ami student accomplishment* in projects and 
reports for evaluation, tests-informal or tcacl>cr*madc and stand* 
ardized-have usually served as the most frequently used means 
of evaluation. 

Standardized Tests 

Until recently national standardized tests dominated the 
evaluation scene and greatly influenced the nature of teacher- 
made tests. Intelligence, aptitude, and achievement tests were all 
normative-based. These tests were designed to determine tow 
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students ranked in relation to the national average with an equal 
number of scows distributed, according to the normal curve, 
above and Ik'Iow the median. Thus, in normative-based testing, 
half of the students must 1h § below uonual. 11 teaching and learn- 
ing were to improve so that most ehildreu reach<*d or surpassed 
the median score on a partic ular test, the* test would have to be 
made harder so that half the students taking the test would 
always fail. (6) 

During the ittfiO's ami early 70s iy tests were severely criti- 
cized because the)* are "biased against those who are economi- 
cally disadvantaged and culturally and linguistically different, 
and especially against all minority groups.** (26, p. 26 ) Further- 
more, M . . . tlie use of the typical intelligence 1 test contributes to 
what has come to l>e termed *tlic self-fulfilling prophecy,' where- 
by students* achievement tends to fulfill the expectations held bv 
others-" (37. p. 54) 

As increased emphasis has been placid on writing specific 
learning objectives for local school systems, schools, classrooms, 
and even individual students, standardized tests have become 
ever more unsatisfactory as evaluative instruments, for they tell 
more about how students rank in relation to each other titan what 
they have accomplished in relation to their goals. 

Dissatisfaction with standardized tests became so strong by 
1972 that a resolution was passed that year by the NEA Repre- 
sentative Assembly to encourage "the elimination of group stand- 
ardized intelligence, aptitude, and achievement tests to assess 
student potential or achieve.nent until completion of a critical 
appraisal, review, and revision of current testing programs." A 
Task Force on Testing was appointed to study the situation and 
to prepare a final report for the 1975 Assembly. 

An interim report of the Task Force in 1973 concluded with 
this statement: 

In summary, tlie Task Force believes that the major use 
of tests should tie for the improvement of instruct ion- for 
diagnosis of learning difficulties and for piescribing learn- 
ing activities in response* to learning needs. They must not 
be used in any way that will lead to latx*ling and classify- 
ing of students, for tracking into homogeneous groups as 
the major determinants to educational programs, to per- 
petuate an elitism, or to maintain some groups and indi- 
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viduals "in their place* near the bottom of the socioeco- 
nomic ladder. In short, tests must not be used in ways that 
will deny any student full access to equal educational 
opportunity. (37, p. 54) 

Criterion-Referenced Tests 

Partiallv in response to such criticisms and partially in re- 
sponse to new developments in curriculum and method* of in- 
struction, such as individualized instruction, national test makers 
as well as state and local school systems are beginning to develop 
new lands of tests called "criterion-referenced tests." Criterion- 
referenced tests, or the similar "object ves-referenced*' tests, have 
also been advocated and used lately for evaluating the effective- 
ness of teaching as part of the accountability movement. 

Criterion-referenced tests seek to determine how well stu- 
dents have mastered their specific learning objectives rather than 
how they rank with other students. Their major purpose is to 
measure individual progress and identify needed additional expe- 
riences to improve Warning. 

The National Assessment of Educational Progress utilizes 
criterion-referenced tests. The individual teacher who uses very 
specific learning objective* as a basis for creating teacher-made 
tests is also utilizing criterion-referenced testing. 

William F. Brazziel describes the potential for use of crite- 
rion-referenced tests as follows: 

In an improved situation utilizing CRT, the teacher 
would administer a locator or formative test tising material 
to be mastered as criteria and would teach the children for 
a year. Over this period, he would have administered 
many summative tests indicating master)' of the units and 
skills he set out to teach during the year. He might give a 
final comprehensive summative test. Under this system, the 
children would end the year with comprehensive, explicit 
records that would indicate what they set out to learn, 
what they had learned, and the point at which learning 
should begin in the next school year (8, p. 53) 
However, in actual practice the distinction between criterion- 
referenced and norm-referenced tests may be quite blurred. Some 
test makers use very similar procedures to construct items, often 
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the same items, and use test statistics designed for norm-refer- 
diced items in order to select items for criterion-referenced tests. 
There are no clearly defined and commonly agrc««d upon proce- 
dures for constructing criterion-referenced tests, and in fact niativ 
criterioii-referenceil tests are actually norm-referenced tests in 
disguise. The distinction is often a matter of emphasis rather than 
l>eing clear-cut. 

Some writers claim that mither criterion-referenced teats nor 
objectivcHi-refereiKH^l ti-sts eliminate the most common deficiencies 
of tests in general. Tiiev both still measure short-term retention 
and simple tasks at the expense of long-term retention, relearning 
abilities, and higher-level thought processes. (33) Complex per- 
formances are so difficult to measure that simpler tasks are sub- 
stituted as test items. Billet's own categories of mental imagery, 
imagination, aesthetic appreciation, and moral sensibility are left 
unmeasured. 

Self-Evaluation 

Even more significant than new kinds of tests are recent 
trends in school programs that depend less upon formal testing 
of any kind and more* upon student self-evaluation as an integral 
part of the learning pnx-ess. Individualization, flexible schedul- 
ing, open seliools, humane education, diagnostic teaching, and 
other programs that seek to tailor the learning situation to the 
specific needs and styles of specific learners make possible, in fact 
require, a high degree of student self-evaluation. 

Several conditions are necessary for student self-evaluation 
to 1k» effective. The student should !>e in a climate that encour- 
ages self-direction. Kach student should have a role in establishing 
he r his learning goals, and the teacher should furnish instruc- 
tional materials that have built-in evaluation aids. e.g.. balance* 
beams, new kinds of programmed !x>oks, ami old-fashioned an- 
swer hooks. 

Another important condition for effective* self k valuation is 
the opportunity for frequent individual student-teacher confer- 
ences, for self-evaluation does not eliminate the need for teacher 
guidance* or teacher evaluation, in fact, teachers who promote 
student self-direction and self-instruction are cautioned to pre- 
pare students for such n*ponsihility gradually and systematically. 
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l>oris M. supports student self-direction as follows: 



W e now know that the necessary alternative is to come 
closer to helping e ach move ahead fairlv continuously at 
his own growing edge. Yet we an- still faced by the vir- 
tually insoluble proble ms of providing so much differ- 
entiation if the teacher has to plan it all. Self-direction, 
within a reasonable framework, seems the most effective 
solution. For it adds the learners perceptions and diag- 
nosis of his own ease* to the teachers broader understand- 
ings of the field as a whole. (22, p. 77) 
Uv says further, M If, when they art* on their own, tomorrow's 
citizens arc to keep themselves knowledgeable and informed so 
that they may continue to lie effec tive citizens of their world, 
they must begin to learn self-direetion now* (22, p. 78} 

Promising Trends in Evaluation 

Ole Sand summarizes promising trends in evaluation as being: 
From To 
!. tests as punishment evaluation as a stimulant, a 




6. evaluation onlv by 
the teacher 



5, narrow range of 
behaviors measured 



4. exams at the end of a 
course 



3. memory of the facts 



2. measurement by paper 
and pencil tests 



humane guide to continued 
growth and learning 
a variety of evaluation tech- 
niques with emphasis on 
observation 

focus on creativity and inquiry 
cooperative and continuous 
evaluation 

evaluation of cognitive, affec- 
tive, and psychomotor behaviors 
self-evaluation 



7. colleges setting 
"standards" for 
admission 



colleges cleaning up their sterile 
programs and working with 
schools to develop valid evalua- 
tion techniques. (30, p. 123) 
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